Extremum statistics in scale-free network models 
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We investigate the statistics of the most connected nodes in scale-free networks. For a scale- 
free network model with homogeneous nodes, we show by means of extensive simulations that the 
exponential truncation — due to the finite size of the network — of the degree distribution governs 
the scaling of the extreme values. We also find that the distribution of maxima obeys scaling and 
follows the Gumbel statistics. For a scale-free network model with heterogeneous nodes, we show 
that scaling no longer holds and that the truncation of the degree distribution no longer controls 
the maximum distribution. Moreover, we find that neither the Gumbel nor the Frechet statistics 
describe the data. 
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The statistics of extrema is a classical subject of great 
interest in mathematics and physics In physics, ex- 
treme events have been studied in a number of fields, 
including self-organized fluctuations and critical phenom- 
ena material fracture ||, disordered systems at low 
temperatures Q, and turbulence ||. Knowledge of ex- 
treme event statistics is also of fundamental importance 
to predict and estimate risk in a variety of natural and 
man-made phenomena such as earthquakes, changes in 
climate conditions, floods Q, and large movements in fi- 
nancial markets H . A new field where extreme statistics 
is of interest is complex networks pi. For one partic- 
ular class of complex networks |j| — scale-free networks 
p0| , p"T — it is well known that the most connected nodes 
strongly influence the dynamics of the system, playing a 
fundamental role in many different phenomena such as 
Internet response to attacks Jl2| , spreading of epidemics 
iPI , or propagation of email virus |fy|| . Surprisingly, so 
far there has been no attempt to characterize the distri- 
bution of extreme connectivities in scale-free networks. 

An important result in extreme statistics is that 
the distributions of maxima for independent identically- 
distributed (iid) random variables fall onto a small num- 
ber of universality classes jj. Let C = {fci, k 2 , k$} 
be a set of iid variables drawn with probability density 
function p(k). The distribution p(K) of the maximum 
K in the set C is dictated by the asymptotic behavior of 
the tail of p(k) JjJ. Specifically, p(K) converges to the 
Gumbel distribution, 



p(K) — a exp(— u 



(1) 



where u = a(K — b), whenp(fc) decays faster than a power 
law; and to the Frechet distribution, 



p(K) = aK-( a+ V exp(-^- Q ), 



(2) 
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when p(k) decays as £r( Q+1 > JT|, 

Unlike the case of iid random variables, little is known 
when correlations are present among the variables ki 
j|, [Tt]]. Even though the universality classes of uncor- 
rected and correlated variables may not necessarily be 
the same, correlated systems have been generall y s tud- 
ied under the framework of iid extreme statistics flSfl . In 
the case of scale-free networks, which we consider in this 
Letter, correlations are present in the degrees ki (i.e. the 
number of links) of the nodes |h| . 

To investigate the extreme statistics in scale-free net- 
works, we consider here the fitness model of Ref [ pfj[ . 
This model is a generalization of the scale-free model 
of Ref. , in which nodes have heterogeneous fitnesses 
{rji; i = 1, S}. The fitness m models an inherent qual- 
ity of the node i that "weighs" its attractiveness to new 
links. In this model, the network starts with so nodes, 
each with sq — 1 links. At time t, a new node is added 
to the network and establishes so — 1 new links. A new 
link is established with a node i, from the set of the 
t — 1 + so existing nodes, with a probability proportional 
to the node degree ki and fitness ?/; 



IT = 



kiTji 



(3) 



This mechanism, typically denoted "preferential attach- 
ment," drives the network to a degree distribution that 
decays in the tail as a power law |lC| ] . In the homogeneous 
case, rji = 1 for all i, one recovers the original model of 
Ref. [I0|, and generates a network with cumulative de- 
gree distribution that decays as P{k) ~ k~ 2 |l(J. In the 
other case we study here, the fitnesses rji are drawn from 
a uniform distribution r}% e [0,1]. This case generates 
a network with a cumulative degree distribution of the 
form P(k) - k- a /\og(k), with a = 1.225 gjj. 

For the case of nodes with homogeneous fitness — i.e., 
all r/i are equal — we find that the distribution of maxima 
obeys Gumbel statistics. This is a surprising result for 
two reasons: (i) the degrees ki are not iid variables and 
are not equally distributed — hence, there is not an a pri- 
ori justification to expect that one of the two universality 
classes (1) or (2) will hold — and (ii) the distribution p(k) 
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FIG. 1: (a) Cumulative degree distribution for the case r]i = 1, 
corresponding to the scale- free model of Ref. E3]. The cumu- 
lative distribution decays as a power-law with exponent a = 2, 
followed by an exponential truncation. The inset shows the 
data collapse obtained by the rescaling S a8 P oc (fc/S 19 ) Q , 
where 9 — 1/a = 0.50 ± 0.03 is the exp onent controlling the 
onset of the exponential truncation |22], |23j] . (b) Distribution 
of the maximum degree. We generate 10° network realizations 
for each size S and confirm that the results are indistinguish- 
able from those obtained for 10 4 network realizations. In all 
simulations the initial size is sq — 2. We plot the distribu- 
tion functions of the scaled maximum K' = S ' K, where 
7 = 0.50 ± 0.03. As expected, the distribution of maxima 
displays the same scaling as the truncation of the degree dis- 
tribution i.e., 7 w 0. Also shown is the fittings of the data to 
the Gumbel with u = —2.1(K' — 1.6), and Frechet distribu- 
tions with a = 2. The maximum statistics agrees well with 
the Gumbel distribution for K' < 5. 
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FIG. 2: (a) Cumulative degree distribution for the model of 
Ref. |20j with uniform fitness distribution. In this case, the 
distributions decay with an exponent a = 1.255 with loga- 
rithmic corrections ]2(J . Note that this result is different from 
the results of Fig. 2a of |5o| which shows a plateau instead 
of an exponential truncation. The inset shows the curves 
rescaled in the same way as in Fig. [j](a), but with an expo- 
nent 9 — 0.76 ± 0.05. (b) Data collapse for the maximum 
degree distribution, obtained from 10 5 network realizations 
for each size. We use the rescaling relation K' = S ' K, 
with 7 = 0.7 ±0.1 to collapse the data. The thin dotted and 
dashed lines are the fitting of the data for the Gumbel with 
u = -2.5(K' - 0.75), and Frechet with a = 1.255. For this 
case the curves do not collapse well. On the contrary, the dis- 
tributions become broader as the network grows, appearing 
to converge to the Frechet distribution as S increases. 



decays as a power law — hence, one would more likely ex- 
pect to find Frechet statistics. For the case of nodes with 
heterogeneous fitness, we find (i) absence of scaling, i.e., 
the shape of the distribution changes with the network 
size, and (ii) that the distribution of maximum, for finite 
network sizes, is not consistent with either of the univer- 
sality classes represented by Eqs. ([!]) and (||). However, 
our results are consistent with the possibility that the 
distribution of maxima may converge to the Frechet dis- 
tribution in the thermodynamic limit. We show that the 
absence of scaling for the heterogeneous fitness case is 
due to the progressive entry in the system of nodes with 
larger fitness that eventually become the new maxima. 

First we consider the case rji — 1 for all i. The distri- 



bution of the maximum degree K is non trivial because 
(i) the degrees ki display a constraint on the total num- 
ber of links, and (ii) the variables ki are not identically 
distributed. Indeed, recent studies have shown that each 
node has a different probability distribution for its degree 
Pi(ki) obeying an exponential form with a characteristic 
scale that depends on the square root of the node index 

Figure |l](a) shows the cumulative degree distribution, 
— 12k'>kP(k'), f° r different network sizes S for the 
homogeneous case, r\i — 1. The curves where obtained 
by numerically iterating the rate equation proposed in 
Ref p2 | . As expected, the curves display a power law de- 
cay, P(k) ~ k~ a , with a = 2, followed by an exponential 
truncation. The inset shows the data collapse obtained 
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FIG. 3: Distribution of the fitness r/max of the node with 
the maximum degree. As expected, the node with the 
largest fitness eventually becomes the most connected dur- 
ing the growth of the network. The inset shows the col- 
lapse of the data performed under the transformation \ = 

(1 - Vmax) log S. 

by rescaling all curves according to 

where 9 = 0.50 ± 0.03 = 1/a = 1/2 is the exponent 
that governs the onset of the exponential truncation p3| . 
Figure 0(b) shows the distribution of maximum degree 
rescaled as K' = S~^K, with 7 = 0.50 ± 0.03. Also 
shown in Fig. |l|(b) are the best fittings of the data to the 
Frechet and Gumbel distributions. It is visually appar- 
ent that the Gumbel distribution describes the maximum 
statistics surprisingly well for K' < 5 [ pi[ . 

This is a surprising result for two reasons: First, we 
find Gumbel statistics even though the degrees ki are not 
iid variables and are not equally distributed. Hence, our 
results appear to indicate that in this case these facts 
do not affect the maximum statistics. Second, we find 
Gumbel statistics even though p(k) decays as a power law 
which would lead one to expect Frechet statistics. The 
reason for not finding Frechet statistics is the fact that 
the network growth process generates finite size networks. 
This contrasts with the procedure when drawing a set 
of S variables from a given distribution. Specifically, if 
one draws ten samples of size 1000, this is no different 
from drawing a single sample of size 10,000. In contrast, 
in the case of a scale-free network, there is a limit for 
the maximum degree possible P5[ that is controlled by 
the exponent 9; i.e., K ~ S e ]22|, g3fl. Hence, for scale- 
free networks, generating ten networks of size 1000 leads 
to a statistically different set of fc^'s than generating one 
network of size 10,000. In the former case K ~ 1000 5 « 
30, while in the latter K ~ 10000 5 « 100. Because for 
scale- free networks there is an exponential truncation due 
to finite network size of p(k), it is natural that we find 
Gumbel statistics of the maxima and that 9 = 7. 

We next consider the case with uniform distribution 
of fitnesses. As before, we study the degree distribution 
and the maximum statistics [p6| . As shown in Fig 0(a) , 
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FIG. 4: The results shown in this figure correspond to a test 
with the fitness model where we arbitrarily set the two first 
nodes present in the beginning of the dynamics to have r\ = 1. 
(a) Cumulative degree distribution. The data collapse shown 
in the inset has been obtained in the same fashion as Figs. |l](a) 
and |(a), with a = 1.27±0.05 and 9 = 0.78±0.04. (b) Distri- 
bution of maximum degree averaged over 10 5 network realiza- 
tions. The rescaling relation K' = S~~'K, with 7 = 0.78±0.04 
has been used. The scaling exponent is the same that we 
found for the degree distribution in this case. This supports 
the hypothesis that the "anomalous" behavior of the extreme 
statistics in the fitness model is due to the introduction of a 
vertex with large fitness in the network which subsequently 
overtakes the position of the highest-degree node. The thin 
dotted line is the best fit to the data of the Gumbel distribu- 
tion with u = -A(K' - 0.76). 



the cumulative degree distributions follow the expected 
scaling and display, as in the previous case, an exponen- 
tial truncation that scales as S with 9 = 0.76 ± 0.05. 
In Fig. ^(b) we show the rescaled distributions of max- 
imum degree for different network sizes {K' = S 1 K) 
with 7 = 0.7 ±0.1. Although the data collapse is not per- 
fect, the estimates for the exponents 9 and 7 are within 
statistical uncertainties and in agreement with the con- 
jectured value 9 = 1/a = 1/1.255 w 0.786 f|, EH- 

In Fig. ||(b) we also show the best fitting to the data 
of the Gumbel and Frechet distributions. The results do 
not follow any of the two classical distributions, but the 
curves appear to converge to the Frechet distribution as 
S increases. 

In order to understand the effect of fitness in the 
growth model, we compute the fitness distribution of 
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the most connected node. Figure |3| shows that, as the 
network grows, nodes with increasing fitness become the 
ones with maximum degree. This is to be expected since 
the growth of the degree of a node increases over time 
as a power law with an exponent proportional to its fit- 
ness [po|| . Indeed, as the network grows, the distribution 
converges logarithmically to a delta function at ?/ = 1. 
Based on this fact, we may then suppose that the failure 
to collapse the data in Fig. ||(b) may be a consequence of 
the slow progressive entry into the system of nodes with 
larger fitnesses which eventually overcome older nodes 
that had the largest degree. 

In order to test this hypothesis, we consider an addi- 
tional case of heterogeneous nodes where the two first 
sites of the growing network (i = 1 and 2) are set to have 
fitness one, while all other nodes have fitnesses drawn 
from a uniform distribution. This case implies that one 
of the two first sites will become the node with maxi- 
mum degree and that the distribution of r) max is a delta 
function at one. 

We calculate the cumulative degree distribution for 
this case and find that the distribution displays a power 
law decay followed by a short plateau just before the 
exponential truncation; cf. Fig [|(a). We find that the 
distributions obtained for the different network sizes can 
be collapsed according to Eq. (Q) with the exponents 
a = 1.27 ± 0.05 and 9 « 0.78 ± 0.04 @. We also 
calculate the distribution of maxima and find that the 
distributions for different S can be collapsed upon the 
rescaling K' = S~ 7 K with 7 w 0.78. The distributions 
are consistent with the Gumbel statistics in the region 
around the most probable maxima. Moreover, our esti- 



mate of 7 is in agreement with the value we obtain for 
the truncation of the power law regime of the cumula- 
tive degree distribution, 9 w 0.78 ± 0.04. These results 
support our conjecture that the absence of good scaling 
in the distribution of maxima for the scale-free model of 
Ref. [^0) is due to the progressive entry of nodes with 
larger fitness. 

The major finding of this study is that the distribution 
of maxima for scale-free models has non trivial proper- 
ties. For the case of homogeneous nodes — i.e., nodes with 
identical fitness — we find that the distribution of max- 
ima follows Gumbel statistics with parameters related to 
the exponent a characterizing the degree distribution. 
We explain this finding by the exponential truncation of 
p(k) due to finite network size. In contrast, for scale-free 
models with heterogeneous nodes having fitnesses drawn 
from an uniform distribution, we find no scaling of the 
distribution of maxima. We explain this lack of scaling 
in terms of the progressive entry of nodes with larger fit- 
ness which over time will establish more links than nodes 
with lower fitness that entered the system earlier. Sur- 
prisingly, our results for this case of heterogeneous nodes 
are not inconsistent with the possibility that the asymp- 
totic distribution of maxima follows the Frechet statistics 
even though p(k) is exponentially-truncated due to finite 
network size. 
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